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Abstract. Distributions of the size of the largest component, in particular the large-deviation tail, are 
studied numerically for two graph ensembles, for Erdos-Renyi random graphs with finite connectivity 
and for two-dimensional bond percolation. Probabilities as small as 10 -180 are accessed using an artifi- 
cial finite-temperature (Boltzmann) ensemble. The distributions for the Erdos-Renyi ensemble agree well 
with previously obtained analytical results. The results for the percolation problem, where no analytical 
results are available, are qualitatively similar, but the shapes of the distributions are somehow different 
and the finite-size corrections are sometimes much larger. Furthermore, for both problems, a first-order 
phase transition at low temperatures T within the artificial ensemble is found in the percolating regime, 
respectively. 
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1 Introduction 

For many problems in science and in statistics, the large 
deviation properties play an important role |1I2) . Only 
for few cases analytical results can be obtained. Thus, 
most problems have to be studied by numerical simula- 
tions [5], in particular by Monte Carlo (MC) techniques 
; |4l5j . Classically, MC simulation have been applied to ran- 
dom systems in the following way: For a finite set of in- 
dependently drawn quenched random instances regular or 
large-deviation properties of these instances have been cal- 
culated using importance-sampling MC simulations. Only 
recently it has been noticed that by introducing an arti- 
ficial sampling temperature also the large-deviation prop- 
erties with respect to the quenched random ensemble can 
be obtained [3] . This corresponds somehow to an annealed 
average, but the results are re- weighted in a way that the 
results for the original quenched ensemble are obtained. In 
this way, the large-deviation properties of the distribution 
of alignment scores for protein comparison was studied 
[61718] , which is of importance to calculate the significance 
of results of protein-data-base queries [5] . 

Motivated by these results, similar approaches have 
been applied to other problems like the distribution of 
the number of components of Erdos-Renyi (ER) random 
graphs 10J, the partition function of Potts models [TTj . 
the distribution of ground-state energies of spin glasses 
[12] and of directed polymers in random media [15] . the 
distribution of Lee- Yang zeros for spin glasses [T3] , the dis- 
tribution of success probabilities of error-correcting codes 
[15] . the distribution of free energies of RNA secondary 
structures [TB] , and some large-deviation properties of ran- 
dom matrices [17118] . 



Interestingly, so far no comparison between numerical 
and mathematically exact results for the full support of a 
distribution involving a large-deviation tail has been per- 
formed to the knowledge of the author. In few cases, nu- 
merical results had been compared to rigorous results [13 , 
but only near the peak of the distribution, where finite- 
size effects are often very small. In another case [TU], the 
numerical and analytical distributions were compared on 
the full support, but the result was obtained using a non- 
rigorous statistical mechanics approach. In this work, the 
numerical large-deviation approach is applied to obtain 
the complete distribution of the size S of the largest com- 
ponent of ER random graphs. In this case also mathemat- 
ically exact results for the leading order large-deviation 
rate function are available for arbitrary values of the fi- 
nite connectivity c. This allows for a comprehensive com- 
parison and an estimation of the strength of finite-size 
effects. Furthermore, in this paper results are obtained for 
the two-dimensional (2d) percolation problem, where no 
analytic results are available for the distribution of the 
largest-component size S. For both models also the de- 
pendence of S on the artificial sampling temperature T is 
studied. First-order phase transitions are found for both 
graph ensembles in the percolating regime. 

The two graph ensembles are defined as follows. In 
both cases, each graph G = (V, E) consists of N nodes 
i <E V and undirected edges {i,j} £ E C V^K For ER 
random graphs [T5], each possible edge {i,j} is present 
with probability c/N . Hence, the average degree (connec- 
tivity) is c. For 2d percolation, the graph is embedded in 
a two-dimensional square lattice of size N = L x L with 
periodic boundary conditions in both directions, i.e., in a 
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torus. Each node can be connected by edges to its four 
nearest neighbours, each edge is present with probabilty 
p. Hence, the average deegree is 4p. 

Two nodes i,j are called connected if there exist a 
path of disjoint edges {iq, ii}, {ii, 12}, ■ ■ ■ , {h-i, ii} such 
that i = ia and j = i\. The maximum-size subsets C C V 
of nodes, such that all pairs i,j G C are connected are 
called the (connected) components of a graph. The size 
of the largest component of a graph is denoted here by 
5*. Via the random-graph enembles, a probability distri- 
bution P(S) for the size of the largest component and the 
corresponding probability P(s) for relative sizes s — S/N 
are defined. The probabilities P(s) for values of s different 
from the typical size are exponentiall small in N. Hence, 
one uses the concept of the large-deviation rate function 
[T] by writing 



P(s) 



-N&(s)+o(N) 



(N 



00 



(1) 



This leading-order behavior of the large-deviation rate 
function $er(s, c) for ER random graphs with connectiv- 
ity c is known exactly [201 and given by the following set 
of equations 



S(s) = slogs + (1 - s)log(l - s) 
7i"i (en) = l-e~ a 

V(a) = (log a — 0.5[a — 1/a]) A 
^er(s,c) = S(s) - slogTri (cs) - (1 - s)log(l - tti(cs)) 

-(1- S )#(c(l-*)) , (2) 

where the expression 17(a) A results in if g(a) > and 
in g(a) else. 

For percolation in finite dimensions similar results are 
not available, to the knowledge of the author. There are 
only analytical results for the distribution of finite (non- 
percolating) components |21) . 

The paper is organised as follows. In the second sec- 
tion, the numerical simulation technique and the corre- 
sponding re- weighting approach are explained. In the third 
section, the results are displayed, first for the ER random 
graph ensemble, next for the 2d percolation problem. Fi- 
nally, a summary and an outlook are given. A concise sum- 
mary of this paper is available at the papercore web page 
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2 Simulation and reweighting method 

To determine the distribution P(S) for any measurable 
quantity S, here denoting the largest component for an 
ensemble of graphs, simple sampling is straightforward: 
One generates a certain number K of graph samples and 
obtains S(G) for each sample G. This means each graph 
G will appear with its natural ensemble probability Q(G). 
The probability to measure a value of S is given by 



P(S) = J2Q(G)5 s{G ),s 



(3) 



Therefore, by calculating a histogram of the values for S, a 
good estimation for P(S) is obtained. Nevertheless, P(S) 
can only be measured in a regime where P(S) is relatively 
large, about P(S) > 1/K. Unfortunately, the distribution 
decreases exponentially fast in the system size N when 
moving away from its typical (peak) value. This means, 
even for moderate system sizes N , the distribution will be 
unknown on almost its complete support. 

To estimate P(S) for a much larger range, even possi- 
bly on the full support of P(S), where probabilities smaller 
than 10~ 100 may appear, a different approach is used [5]. 
For self-containedness, the method is outlined subsequently. 
The basic idea is to use an additional Boltzmann factor 
exp(— S(G)/T), T being a "temperature" parameter, in 
the following manner: A standard Markov- chain MC sim- 
ulation |4I5) is performed, where in each step t from the 
current graph G{t) a candidate graph G* is created: A 
node i of the current graph is selected randomly, with uni- 
form weight 1/N, and all adjacent edges are deleted. For 
all feasible edges {i,j}, the edge is added with a weight 
corresponding to the natural weight Q(G), i.e., with prob- 
ability c/N (ER random graph) or with probability p (per- 
colation), respectively. For the candidate graph, the size 
S(G*) of the largest component is calculated. Finally, the 
candidate graph is accepted, (G(t + 1) = G*) with the 
Metropolis probability 



PMct = mm 



{i.. 



-[S(G')S(G(t))]/T\ 



(4) 



Otherwise the current graph is kept (G(t + 1) = G(t)). 
By construction, the algorithm fulfills detailed balance. 
Clearly the algorithm is also ergodic, since within N steps, 
each possible graph may be constructed. Thus, in the limit 
of infinite long Markov chains, the distribution of graphs 
will follow the probability 



q T (G) = 



1 



Z{T) 



Q(G)t 



-S(G)/T 



(5) 



where Z(T) is the a priori unknown normalisation factor. 
The distribution for S at temperature T is given by 



Pt(S) = ^2qT(G)6 S (G),s 

G 

J5J 



^J2Q( G >~ S(G)/TS S(G h S 



G 

e ~S/T 



Z{T) 

m e- s / T 



P(S) 



Z{T) 

P{S) = e s / T Z(T)P T (S) 



(0) 



G 



Hence, the target distribution P(S) can be estimated, up 
to a normalisation constant Z{T), from sampling at finite 
temperature T . For each temperature, a specific range of 
the distribution P(S) will be sampled: Using a positive 
temperature allows to sample the region of a distribution 



A. K. Hartmann: Large-deviation properties of largest component for random graphs 



3 



left to its peak (values smaller than the typical value), 
while negative temperatures are used to access the right 
tail. Temperatures of large absolute value will cause a sam- 
pling of the distribution close to its typical value, while 
temperatures of small absolute value are used to access 
the tails of the distribution. Hence, by choosing a suitable 
set of temperatures, P(S) can be measured over a large 
range, possibly on its full support. 

The normalisation constants Z{T) can easily be ob- 
tained by including a histogram obtained from simple sam- 
pling, which corresponds to temperature T — ±00, which 
means Z w 1 (within numerical accuracy). Using suitably 
chosen temperatures 7+1, T_i, one measures histograms 
which overlap with the simple sampling histogram on its 
left and right border, respectively. Then the corresponding 
normalisation constants Z(T±\) can be obtained by the re- 
quirement that after rescaling the histograms according to 
([6]), they must agree in the overlapping regions with the 
simple sampling histogram within error bars. This means, 
the histograms are "glued" together. In the same manner, 
the range of covered S values can be extended iteratively 
to the left and to the right by choosing additional suit- 
able temperatures 7+2>7±3, . . . and gluing the resulting 
histograms one to the other. A pedagogical explanation 
and examples of this procedure can be found in Ref. |23j . 

In order to obtain the correct result, the MC simu- 
lations must be equilibrated. For the case of the distri- 
bution of the size of the largest component, this is very 
easy to verify: The equilibration of the simulation can be 
monitored by starting with two different initial graphs, 
respectively: 

— Either an unbiased random graph is taken, which means 
that the largest component is of typical size. In the in- 
set of Fig. [T] the evolution of S as a function of the 
number £mcs = t/N of Monte Carlo sweeps is shown 
for Erdos-Reny random graphs with N — 500 nodes, 
connectivity c = 0.5 at temperature T = 2. As one 
can see, S(iMCs) moves quickly away from the typical 
size which is around S — 30 towards a values around 
S = 200. This shows that easily different parts of the 
distribution can be addressed. The result of a second 
run with a negative temperature is shown in the same 
inset. In this case an initial graph was used which con- 
sists of a single line of nodes, i.e., in particular the 
graph is fully connected leading to S = N. 

— Alternatively, if the temperature is positive, one can 
start with an empty graph (S — 1). In any case, for 
the two different initial conditions, the evolution of 
S^mcs) w iU approach from two different extremes, 
which allows for a simple equilibration test: equili- 
bration is achieved if the measured values of S agree 
within the range of fluctuations. Only data was used 
in this work, where equilibration was achieved within 
200 Monte Carlo steps. 

The resulting distribution for ER random graphs (C = 
2, N = 500) is shown in the main plot of Fig.Q] As one can 
see, the distribution can be measured over its full support 
such that probabilities as small as 10 -180 are accessible. 
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Fig. 1. Distribution of the size S of the largest com- 
ponent for Erdos-Renyi random graphs of size N = 500 
at connectivity c = 0.5. In this and all other plots, error 
bars are of symbol size or smaller if not explicitly shown. 
The inset shows the size of the largest component as func- 
tion of the number tMcs of Monte Carlo sweeps for the 
same type of graphs at temperature T = —2. Two dif- 
ferent starting conditions are displayed: either a random 
graph (leading to a typical components size around 30) or 
a graph consisting of a line (S — 500) were used. 

Note that in principle one can also use a Wang-Landau 
approach [24 or similar approaches to obtain the distribu- 
tion P(S) without the need to perform independent simu- 
lations at different values for the temperatures. Neverthe- 
less, the author has performed tests for ER random graphs 
and experienced problems by using the Wang-Landau ap- 
proach, because the sampled distributions tend to stay 
in a limited fraction of the values of interest. Using the 
finite-temperature approach it is much easier to guide the 
simulations to the regions of interest, e.g., where data is 
missing using the so-far-obtained data, and to monitor the 
equilibration process. Furthermore, the behavior of S as 
a function of T appears to be of interest on its own, see 
next section. 



3 Results 

ER random graphs of size N — 500 and 2d percolation 
problems with lateral size L = 32 (N = 1024 sites) were 
studied. In few cases, additional system sizes were consid- 
ered to estimate the strength of finite-size effects, sec be- 
low. For each problem, the model was studied right at the 
percolation transition, for one point in the non-percolating 
regime, and for one point in the percolating regime. The 
temperature ranges used for the different cases are shown 
in Tab. [1] Note that, depending on the position of the 
peak of the size distribution, sometimes positive, some- 
times negative and sometimes both types of temperatures 



4 



A. K. Hartmann: Large-deviation properties of largest component for random graphs 



system 


Ti 


T 2 


N T 


ER c = 


0.5 


-5 


-0.4 


14 


ER c = 


1.0 


-7.0 


-0.6 


9 


ER c = 


2.0 


-2.0 


10.0 


6 


perc p = 


= 0.3 


-10.0 


-0.6 


13 


perc p = 


= 0.5 


-5.00 


50.0 


13 


perc p = 


= 0.6 


0.60 


30.0 


14 



Table 1. Parameters used to determine the distributions 
P(S) for the different models. T± is the minimum and Ti 
the maximum temperature used. Nt denotes the number 
of different temperature values. Note that for determinin- 
ing the average value S(T), see Figs. 2] and El usually a 
higher number of temperatures was used. 



had to be used. For the systems listed in the table, equili- 
bration was always achieved within the first 200 MCS. In 
general, studying significantly larger sizes or going deeper 
into the percolation regime makes the equilibration much 
more difficult. After equilibration, data was collected for 
9800 MCS, in some cases, to improve statistics, for about 
10 6 MCS. In the two subsequent subsections, the results 
for ER random graphs and for 2d percolation are pre- 
sented, respectively. 
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Fig. 2. Large-deviation rate function <P(s) of ER random 
graphs with average connectivity c — 0.5 < c c , N = 500 
(symbols). The line displays the analytical result from Eq. 
@. The inset shows the same for the case c = 1.0 = c c . 



3.1 ER graphs 

For the numerical simulations, first the case of ER random 
graphs is treated, because the analytical result can be 
used for comparison. This allows to assess the quality of 
the method and to get an impression of influence of the 
non-leading finite-size corrections. 

In Fig. [2] the empirical rate function 

= log P( fl ) (7) 

for c = 0.5 is displayed, corresponding to the distribution 
shown in Fig. [T] Note that by just stating the analytical 
asymptotic rate function <?er(s, c), the corresponding dis- 
tribution P(s) is not normalised. Hence, for comparison, 
<P(s) is shifted for all values of the connectivity c such that 
it is zero at its minimum value, like $er(s,c). 

The numerical data agrees very well with the analytic 
result. Only in the region of intermediate cluster sizes, a 
small systematic deviation is visible, which is likely to be a 
finite-size effect. Given that for the numerical simulations 
only graphs with N = 500 nodes were treated, the agree- 
ment with the N — > oo leading-order analytical result is 
remarkable. 

The resulting rate function right at the percolation 
transition c = c c = 1 is shown in the inset of Fig. [2] Qual- 
itatively, the result is very similar to the non-percolating 
case c = 0.5, except that the distribution is much broader, 
corresponding to smaller values of the rate function. Again, 
the agreement with the analytical result is very good, ex- 
cept for the data close to the origin s = 0: The numerical 
results exhibit a minimum near s = 0.05, while the analyt- 
ical result exhibits its minimum naturally at s = 0. This 
is clearly due to the finite size of the numerical samples. 



The case of the percolating regime (connectivity c = 
2), is displayed in Fig. [3] The rate function exhibits a 
minimum at a finite value of s, corresponding to the finite 
average fraction of nodes contained in the largest compo- 
nent. The behaviour of the rate function is more interest- 
ing compared to c < c c , because ^er(s) grows strongly 
near its minimum, but for s — > it levels off horizontally. 
For most of the support of the distribution, the numer- 
ical data for N — 500 agrees again very well with the 
analytic result. Nevertheless, for s — > 0, strong deviations 
become visible because the numerical rate function $(s) 
grows strongly as s — > 0. By comparing with the result 
for a smaller system, N — 100, where this deviation is 
even larger, it becomes clear that this is a finite-size ef- 
fect, corresponding to the non-leading corrections, which 
will disappear for N — > oo. 

Although the parameter T is mainly used as a way to 
address different parts of the distribution, the temperature 
dependence of S, which is shown in Fig. HJ exhibits an in- 
teresting behaviour on its own in the percolating regime. 
When studying the average size s as a function of tem- 
perature, a strong increase around temperature T = 9.5 
becomes visible, which may correspond to a kind of phase 
transition, see below. 

On the other hand, in the non-percolating regime c < 
1, the average size of the largest component is rather 
smooth, as shown in the inset of Fig. [U Note that here 
negative artificial temperatures are used, because the peak 
of the distribution P(S) is close to S = 0, in contrast to 
the percolating regime c > 1, where the peak of P(S) is 
at finite values. 

In Fig. [5] the size S of the largest component is shown 
as a function of the number of MC sweeps for a temper- 
ature T = 9.5, which is located in the regime of the as- 
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Fig. 3. Large-deviation rate function #(s) of ER random 
graphs with average connectivity c = 2 > c c , N = 100 
and N = 500 (symbols). The line displays the analytical 
result from Eq. @ . 
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Fig. 4. Average relative size s of the largest component 
as a function of artificial temperature T for ER random 
graphs in the percolating regime (c = 2) and (inset) in the 
non-percolating regime (c = 0.5). 



sumed transition. One can see that S(t) fluctuates quickly 
between two sets of typical sizes, which shows also that 
the data is well equilibrated. In particular, values around 
S = 350 and around S = 100 are more frequent than 
intermediate values. 

This result is made more quantitative by studying the 
resulting distribution of the sizes of the largest component, 
see Fig. [5] A two-peak structure can be observed, which 
indicates that indeed a transition between small and large 



500 



400 



300 



-i — i — i — | — i — i — i — i — | — i — i — i — i — | — i — i — i — i — | — i — i — r- 

ER random graphs 



c=2, N=500, T=9.5 



t>3 




200 - 



ioo r 



10000 



MCS 



Fig. 5. Time-series for the size S of largest cluster as 
function of the number £mcs of MC sweeps for ER random 
graphs (c = 2, N = 500) at artificial temperature T = 9.5. 



components of first-order type is present in the percolating 
regime c > 1. Similar first-order transitions have been ob- 
served for biased simulations of the one-dimensional Ising 
model 25 26 . The first-order nature of this transition is a 
reason that obtaining $(s) becomes harder for large sys- 
tem sizes, because the time for tunnelling between the 
two sets of values grows quickly. Furthermore, even worse, 
the number of observed configurations having a value of 
S which is located between the peaks decreases strongly, 
such that for large intervals no data can be collected at 
all, already for N = 1000. Hence, P(S) cannot be sam- 
pled for such system sizes on its complete support, because 
the different parts of the distribution cannot be "glued" 
together. 

A detailed study of this phase transition is beyond 
the scope of this work, in particular because its physical 
relevance is yet not fully clear. 



3.2 Two-dimensional percolation 

Again the discussion of the rate function for the distribu- 
tion of the relative size s = S/N of the largest component 
is started by considering the non-percolating regime. The 
rate function for p — 0.3 < p c (L = 32) is shown in Fig. 
[7] In principle it looks similar to the ER case displayed 
in Fig. [5] A striking difference is that it exhibits a large 
region where it behaves linearly, basically for half of the 
support, while it grows stronger for s > 0.5. For L = 16 
(not shown), the same result was found. This means, the 
finite-size effects are small as in the ER case. This also 
indicates that the shape of the rate function should basi- 
cally remain the same for L — > oo. In particular, it appears 
likely that for L — > oo, <£(s) will consist of a linear part 
for small values of s and it will grow stronger for s — > 1. 
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one) of the size of the largest component for ER random 
graphs (c = 2, N = 500) at artificial temperature T = 9.5. 
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Fig. 7. Large-deviation rate function <P(s) of two- 
dimensional (2d) bond percolation with occupation prob- 
ability p = 0.3 < p c , L = 32 (symbols). For small rela- 
tive cluster sizes s, the rate function behaves linearly (the 
line displays a linear function with slope 0.1125, obtained 
from fitting a linear function to the data in the range 
se [0,0.5]). 



In contrast, the large-deviation rate function right at 
the percolation transition p — p c = 0.5 (L = 32), see 
Fig. [SI looks very different from the ER case: It exhibits 
a minimum at a relative cluster size of about s = S/N = 
0.78, see main plot in Fig. HI This is very large compared 
to the ER case, but only a finite-size effect: For example 
for L = 512 (additional simple sampling simulations, not 



shown), the minimum of the rate function has already 
moved to a smaller value of s 0.5 and for p = 0.49, just 
before the percolation transition, the most likely relative 
size of a cluster is s — 0.1. Hence, the finite-size effects, 
i.e., corrections to the large-deviation rate function, are 
close to p c much stronger for the finite-dimensional case 
compared to ER random graphs. 

The case of the percolating regime is displayed in the 
inset of Fig. [5] Here again, the result looks very similar to 
the ER case, except that the magnitude of the rate func- 
tion is somehow smaller. Hence, it appears likely that the 
shape of the rate function for the 2d percolation problem 
in the limit L — > oo is very similar to the ER case, i.e., it 
may level off horizontally for s — > at a finite value and 
the strong increase found for L — 32 is again a finite size 
effect. This is confirmed by the result for L = 16, which 
exhibits a much stronger increase for s — > compared to 
the L = 32 result, as in the case of ER random graphs. 
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Fig. 8. Large-deviation rate function <P(s) of two- 
dimensional (2d) bond percolation with occupation prob- 
ability p — 0.5 = Pc, L — 32 and (inset) p = 0.6, 
L = i6/L = 32. 



In Fig. IH1 the result for the behaviour of S as a func- 
tion of the artificial temperature is shown. In the non- 
percolating regime (p = 0.3), see right inset, the average 
value s behaves very regularly as a function of the tem- 
perature, no sign of a transition is visible. On the other 
hand, inside the percolation regime (p = 0.6), see main 
plot, s(T) exhibits a strong increase around T — 27. The 
distribution of S at T = 27 exhibits a strong bimodal sig- 
nature, which indicates that indeed a phase transition of 
first-order type takes place. In summary, the temperature 
dependence of the size of the largest component is very 
similar to the results obtained for the ER random graphs. 
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Fig. 9. Average relative size s of the largest component 
as a function of artificial temperature T for 2d percolation 
(L = 32) in the percolating regime (jp = 0.6) and (right 
inset) in the non-percolating regime (p = 0.6). The left 
inset shows the distribution (normalised such that the in- 
tegral is one) of the size of the largest component (p = 6) 
at artificial temperature T — 27 (line is guide to the eyes 
only). 



For the 2d percolation problem, where no analytic re- 
sults are available, basically the same results are found: 
the shape of the large-deviation rate functions below, at, 
and above the percolation threshold are qualitatively the 
same, except that the finite-size corrections at the perco- 
lation threshold appear to be larger compared to the ER 
results. Also the behaviour of the largest-component size 
as a function of the temperature seems to be similar, in 
particular exhibiting a first-order type transition for the 
percolating regime. 

Since the comparison with the exact results for the ER 
random graphs indicates the usefulness of this approach 
to study large-deviation properties of random graphs, it 
appears promising to consider many other properties of 
different ensembles of random graphs in the same way. 
For example, it would be interesting to obtain the distri- 
bution of the diameter of ER random graphs, where only 
for c < 1 there is an analytic result available. Correspond- 
ing simulations are currently performed by the author of 
this work. 
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4 Summary and outlook 
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By using an artificial Boltzmann ensemble characterised 
by an artificial temperature T, the distributions of the size 
of the largest component for ER randoms graphs with 
finite connectivity c and for 2d percolation have been 
studied in this work. For not too large system sizes, the 
distributions can be calculated numerically over the full 
support, giving access to very small probabilities such as 

io- 180 . 

For the ER case, the numerical results for the large- 
deviation rate function <P(s), obtained for rather small 
graphs of size N = 500, agree very well with analytical 
results obtained previously for the leading behaviour in 
the limit N — > oo. This proves the usefulness of the nu- 
merical approach, which has been applied previously to 
models where no complete comparison between numeri- 
cal data and exact analytic results have been performed. 
The main findings are that below and at the percolation 
transition, <P(s) exhibits a minimum at s — and rises 
monotonously for s — > 1. Inside the percolating regime, 
<P(s) exhibits a minimum, grows quickly around this min- 
imum and levels off horizontally for s — > 0. The finite-size 
corrections are usually small, except for the percolating 
regime in an extended region near s = 0. Furthermore, 
when studying the average value s as a function of tem- 
perature, a transition of first-order type is found between 
a phase where S is untypically small to a phase where S 
is large. 
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